Improve perf of Utf8Parser.TryParse for int64 and uint64 #52423

GrabYourPitchforks · 2021-05-07T00:08:19Z

Somewhat inspired by investigating #52314 and seeing a little bit of low-hanging fruit that we can address.

Perf results (basically from running this benchmark with some additional inputs):

Method	Toolchain	value	Mean	Error	StdDev	Ratio
TryParseInt64	baseline	-1	3.73 ns	0.022 ns	0.018 ns	1.00
TryParseInt64	compare	-1	3.25 ns	0.025 ns	0.021 ns	0.87

TryParseInt64	baseline	-12345	5.93 ns	1.680 ns	0.092 ns	1.00
TryParseInt64	compare	-12345	5.85 ns	1.097 ns	0.060 ns	0.99

TryParseInt64	baseline	-12345678901234567	15.75 ns	43.214 ns	2.368 ns	1.00
TryParseInt64	compare	-12345678901234567	13.24 ns	0.371 ns	0.020 ns	0.85

TryParseInt64	baseline	-9223372036854775808	19.35 ns	0.326 ns	0.305 ns	1.00
TryParseInt64	compare	-9223372036854775808	17.12 ns	0.085 ns	0.076 ns	0.88

TryParseInt64	baseline	0	3.17 ns	0.097 ns	0.005 ns	1.00
TryParseInt64	compare	0	3.21 ns	0.602 ns	0.033 ns	1.01

TryParseInt64	baseline	1	3.27 ns	0.060 ns	0.003 ns	1.00
TryParseInt64	compare	1	3.21 ns	0.322 ns	0.017 ns	0.98

TryParseInt64	baseline	12345	5.76 ns	0.738 ns	0.040 ns	1.00
TryParseInt64	compare	12345	5.07 ns	0.929 ns	0.050 ns	0.88

TryParseInt64	baseline	12345678901234567	13.90 ns	0.247 ns	0.013 ns	1.00
TryParseInt64	compare	12345678901234567	12.85 ns	1.504 ns	0.082 ns	0.92

TryParseInt64	baseline	9223372036854775807	17.72 ns	0.102 ns	0.095 ns	1.00
TryParseInt64	compare	9223372036854775807	16.11 ns	0.063 ns	0.059 ns	0.91

Some of the benchmarks were going a bit screwy on my machine, so I basically ran them one at a time and concatenated all of the results into a single table. Not sure what was up with my environment.

The tl;dr is that this shouldn't have any worse perf than the existing code, and the total codegen ends up being 448 bytes smaller (on x64) compared to baseline across the two methods.

ghost · 2021-05-07T00:08:25Z

Tagging subscribers to this area: @tannergooding, @pgovind, @GrabYourPitchforks
See info in area-owners.md if you want to be subscribed.

Issue Details

Somewhat inspired by investigating #52314 and seeing a little bit of low-hanging fruit that we can address.

Perf results (basically from running this benchmark with some additional inputs):

| Method | Toolchain | value | Mean Error | StdDev | Ratio |
|-------------- |---------- |--------------------- |---------:|----------:|---------:|------:|
| TryParseInt64 | baseline | -1 | 3.73 ns | 0.022 ns | 0.018 ns | 1.00 |
| TryParseInt64 | compare | -1 | 3.25 ns | 0.025 ns | 0.021 ns | 0.87 |
| | | | | | | |
| TryParseInt64 | baseline | -12345 | 5.93 ns | 1.680 ns | 0.092 ns | 1.00 |
| TryParseInt64 | compare | -12345 | 5.85 ns | 1.097 ns | 0.060 ns | 0.99 |
| | | | | | | |
| TryParseInt64 | baseline | -12345678901234567 | 15.75 ns | 43.214 ns | 2.368 ns | 1.00 |
| TryParseInt64 | compare | -12345678901234567 | 13.24 ns | 0.371 ns | 0.020 ns | 0.85 |
| | | | | | | |
| TryParseInt64 | baseline | -9223372036854775808 | 19.35 ns | 0.326 ns | 0.305 ns | 1.00 |
| TryParseInt64 | compare | -9223372036854775808 | 17.12 ns | 0.085 ns | 0.076 ns | 0.88 |
| | | | | | | |
| TryParseInt64 | baseline | 0 | 3.17 ns | 0.097 ns | 0.005 ns | 1.00 |
| TryParseInt64 | compare | 0 | 3.21 ns | 0.602 ns | 0.033 ns | 1.01 |
| | | | | | | |
| TryParseInt64 | baseline | 1 | 3.27 ns | 0.060 ns | 0.003 ns | 1.00 |
| TryParseInt64 | compare | 1 | 3.21 ns | 0.322 ns | 0.017 ns | 0.98 |
| | | | | | | |
| TryParseInt64 | baseline | 12345 | 5.76 ns | 0.738 ns | 0.040 ns | 1.00 |
| TryParseInt64 | compare | 12345 | 5.07 ns | 0.929 ns | 0.050 ns | 0.88 |
| | | | | | | |
| TryParseInt64 | baseline | 12345678901234567 | 13.90 ns | 0.247 ns | 0.013 ns | 1.00 |
| TryParseInt64 | compare | 12345678901234567 | 12.85 ns | 1.504 ns | 0.082 ns | 0.92 |
| | | | | | | |
| TryParseInt64 | baseline | 9223372036854775807 | 17.72 ns | 0.102 ns | 0.095 ns | 1.00 |
| TryParseInt64 | compare | 9223372036854775807 | 16.11 ns | 0.063 ns | 0.059 ns | 0.91 |

Some of the benchmarks were going a bit screwy on my machine, so I basically ran them one at a time and concatenated all of the results into a single table. Not sure what was up with my environment.

The tl;dr is that this shouldn't have any worse perf than the existing code, and the total codegen ends up being 448 bytes smaller compared to baseline across the two methods.

Author:	GrabYourPitchforks
Assignees:	-
Labels:	`area-System.Buffers`, `tenet-performance`
Milestone:	6.0.0

gfoidl · 2021-05-07T08:06:25Z

...ies/System.Private.CoreLib/src/System/Buffers/Text/Utf8Parser/Utf8Parser.Integer.Signed.D.cs

+
+                if ((uint)firstChar == unchecked((uint)('-' - '0')))
+                {
+                    sign--; // set to -1


Suggested change

sign--; // set to -1

sign = -1;

Make it explicit?
The JIT will emit code like mov rax, 0xffffffffffffffff anyway.

The JIT emits dec rax for this scenario. I think it doesn't propagate the "if (idx != 0) { FAIL; }" assertion above for some reason.
I can set sign = -1; explicitly, but it does increase the codegen by 7 - 8 bytes. Maybe it's too much of a microoptimization to worry about and the cleaner code is better?

So I'd keep it as is. Thanks for the info.

gfoidl · 2021-05-07T08:11:07Z

...ies/System.Private.CoreLib/src/System/Buffers/Text/Utf8Parser/Utf8Parser.Integer.Signed.D.cs

+            // If sign = -1, this becomes value = (parsedValue ^ -1) - (-1) = ~parsedValue + 1 = -parsedValue
+
+            bytesConsumed = idx;
+            value = ((long)parsedValue ^ sign) - sign;


🚀

The same could be done to other types too (like int32 above, etc.)?

stephentoub

Thanks. Are there any similar gains to be had in {u}long.TryParse?

jeffhandley · 2021-06-12T00:41:33Z

@GrabYourPitchforks Is this ready to merge?

jeffhandley · 2021-07-03T03:27:14Z

@GrabYourPitchforks Is this ready to merge?

Ping on this, @GrabYourPitchforks. It would be great to get this in for Preview 7.

GrabYourPitchforks · 2021-07-12T23:17:43Z

I investigated the framework's other parsing routines, but they have a bit more complexity, needing to handle whitespace and trailing nulls. Opened #55541 so we don't lose track of this.

Improve perf of Utf8Parser.TryParse(out [u]long, default)

70a1064

GrabYourPitchforks added area-System.Buffers tenet-performance Performance related issue labels May 7, 2021

GrabYourPitchforks added this to the 6.0.0 milestone May 7, 2021

GrabYourPitchforks requested review from adamsitnik, pgovind and tannergooding May 7, 2021 00:08

gfoidl reviewed May 7, 2021

View reviewed changes

stephentoub approved these changes May 25, 2021

View reviewed changes

jeffhandley assigned GrabYourPitchforks Jul 3, 2021

GrabYourPitchforks merged commit f787f38 into dotnet:main Jul 12, 2021

GrabYourPitchforks mentioned this pull request Jul 12, 2021

Investigate improving perf of int16, int32, int64 parsing #55541

Open

ManickaP mentioned this pull request Jul 20, 2021

[QUIC] Remove AppContext switch from S.N.Quic #56027

Merged

ghost locked as resolved and limited conversation to collaborators Aug 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve perf of Utf8Parser.TryParse for int64 and uint64 #52423

Improve perf of Utf8Parser.TryParse for int64 and uint64 #52423

GrabYourPitchforks commented May 7, 2021 •

edited

Loading

ghost commented May 7, 2021

gfoidl May 7, 2021

GrabYourPitchforks May 7, 2021

gfoidl May 7, 2021

gfoidl May 7, 2021

stephentoub left a comment

jeffhandley commented Jun 12, 2021

jeffhandley commented Jul 3, 2021

GrabYourPitchforks commented Jul 12, 2021

Improve perf of Utf8Parser.TryParse for int64 and uint64 #52423

Improve perf of Utf8Parser.TryParse for int64 and uint64 #52423

Conversation

GrabYourPitchforks commented May 7, 2021 • edited Loading

ghost commented May 7, 2021

gfoidl May 7, 2021

Choose a reason for hiding this comment

GrabYourPitchforks May 7, 2021

Choose a reason for hiding this comment

gfoidl May 7, 2021

Choose a reason for hiding this comment

gfoidl May 7, 2021

Choose a reason for hiding this comment

stephentoub left a comment

Choose a reason for hiding this comment

jeffhandley commented Jun 12, 2021

jeffhandley commented Jul 3, 2021

GrabYourPitchforks commented Jul 12, 2021

GrabYourPitchforks commented May 7, 2021 •

edited

Loading